Rank Multimodal (RankMM) The RankMM model effectively combines the search paradigms of a text query, page context, and images to aid image and video retrieval. RankMM models are Visual Language (VL) models which take page context into account to improve image and video retrieval performance in a web-scale search engine.
Read More
Enhancing image quality is a critical but challenging computer vision task. We've released a new AI-based model for improving the quality of images on Microsoft Bing. Not only are the Bing image search results relevant, but they are also beautiful and high-resolution. Our new V3 model outperformed the V2 model by 36% in terms of click-through rate, demonstrating that users find visually appealing images in search results more interesting and...
Read More
On the heels of recently shipped Bing Visual Search we are taking another step to streamline the experience where Bing automatically detects and suggests objects to search for. It is called Bing Object Detection. Read on to find out how it works and how we made it happen.
Read More
Over the years people have come to expect search engines to automatically detect intent and provide great search results for text queries typed into a single search box. Now Bing takes the first step to achieve the same for images. The Bing Team sets out to connect your camera to a deep search experience.
Read More